# Web Deployment
Qwen3 0.6B ONNX
Qwen3-0.6B is a lightweight large language model converted to ONNX format for web-based usage.
Large Language Model
Transformers

Q
onnx-community
5,051
8
Depthpro ONNX
DepthPro is a vision model for depth estimation, capable of predicting scene depth information from a single image.
3D Vision
Transformers

D
onnx-community
146
10
Timesformer Hr Finetuned K600
TimeSformer-HR is a video action recognition model optimized for high-resolution videos and fine-tuned on the Kinetics-600 dataset.
Video Processing
Transformers

T
onnx-community
17
0
Timesformer Hr Finetuned K400
TimeSformer-HR is a high-resolution spatiotemporal Transformer model for video, fine-tuned on the Kinetics-400 dataset, suitable for video action recognition tasks.
Video Processing
Transformers

T
onnx-community
17
0
Timesformer Base Finetuned Ssv2
TimeSformer is a Transformer-based video understanding model specifically optimized for temporal action recognition tasks.
Video Processing
Transformers

T
onnx-community
17
0
Timesformer Base Finetuned K400
TimeSformer is a Transformer-based video understanding model, specifically fine-tuned on the Kinetics-400 dataset.
Video Processing
Transformers

T
onnx-community
17
0
Clip Vit Large Patch14
OpenAI's open-source CLIP model, based on Vision Transformer (ViT) architecture, supporting joint understanding of images and text.
Text-to-Image
Transformers

C
Xenova
17.41k
0
Sam Vit Large
Large-scale image segmentation model based on Vision Transformer architecture, capable of generating high-quality object masks from input points
Image Segmentation
Transformers Other

S
Xenova
34
0
Clip Vit Base Patch32
CLIP model developed by OpenAI, based on Vision Transformer architecture, supporting joint understanding of images and text
Text-to-Image
Transformers

C
Xenova
177.13k
8
Clip Vit Base Patch16
OpenAI's open-source CLIP model, based on Vision Transformer architecture, supporting cross-modal understanding of images and text
Text-to-Image
Transformers

C
Xenova
32.99k
9
Whisper Tiny
Whisper Tiny is a lightweight speech recognition model open-sourced by OpenAI, suitable for web deployment.
Speech Recognition
Transformers

W
Xenova
21.70k
8
Detr Resnet 50
An end-to-end object detection model based on Transformer architecture, eliminating the need for anchor box designs in traditional object detection
Object Detection
Transformers

D
Xenova
5,261
16
Featured Recommended AI Models